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Abstract — In this paper, we consider noncoherent random lin- 
ear coding networks (RLCNs) as a discrete memoryless channel 
(DMC) whose input and output alphabets consist of subspaces. 
This contrasts with previous channel models in the literature 
which assume matrices as the channel input and output. No 
particular assumptions are made on the network topology or 
the transfer matrix, except that the latter may be rank-deficient 
according to some rank deficiency probability distribution. We 
introduce a random vector basis selection procedure which 
renders the DMC symmetric. The capacity we derive can be 
seen as a lower bound on the capacity of noncoherent RLCNs, 
where subspace coding suffices to achieve this bound. 

I. Introduction 

Random linear network coding (RLNC), proposed by Ho et 
al. 0, is a very attractive way to obtain the benefits of linear 
network coding [2|, |3| in dynamically changing or large scale 
networks. In a decentralized way, by requiring the intermediate 
nodes to randomly combine incoming packets over a finite 
field, RLNC achieves the network min-cut with probability 
approaching one as the field size goes to infinity. For this 
reason, RLNC is an important research topic and has recently 
received a great deal of attention. Following JU, we call a 
network employing RLNC a random linear coding network 
(RLCN). 

In order to recover the transmitted packets, a receiving node 
needs to know its associated network transfer matrix. In reality, 
the transfer matrix depends on the network topology and the 
network code used. However, to allow for some analysis, 
in previous works on RLCNs authors have made different 
assumptions on the characteristics of the transfer matrix, which 
has been assumed to have full rank 0, 0, bounded rank 
deficiency [7], or entries selected uniformly at random 0, 
0. Also, it has been considered as being selected either by 
an adversary or uniformly at random among all nonsingular 
matrices 0. 

In some works, the approach for the capacity of RLCNs 
is combinatorial. For example, in iflOl . where the authors 
consider the noncoherent scenario in which the transfer matrix 
is assumed unknown to both the transmitter and the receiver, 
a subspace coding approach is prescribed and coding size 
bounds under a topology-independent operator channel model 
are provided. 



A probabilistic approach for the capacity of RLCNs was first 
presented by Montanari and Urbanke |6|, who have considered 
an additive perturbation (error) matrix chosen uniformly at 
random from all matrices of given dimension and rank. Later, 
their work was further extended by Silva et al. 0, who have 
considered both the transfer and the error matrices as chosen 
according to more general probability distributions. 

Recently, Yang and Yang [4] have called the attention to the 
importance of understanding how network topologies affect 
RLNC. More specifically, they studied a particular family of 
unicast networks and identified a worst-case unicast network 
topology: the S-networks. They derived a capacity formula for 
S-networks, and showed that for this topology subspace coding 
is optimal in the sense that it suffices to achieve capacity. 
Similar conclusions regarding the use of subspace coding were 
reached in 0, 0. However, in these works, RLCNs are seen 
as a channel not with subspaces but with matrices as inputs 
and outputs. 

In this paper, we consider the noncoherent scenario of ifTol 
and make no particular assumption on the network topology or 
the network code used. Since the transmission strategy using 
subspaces is oblivious to the underlying network topology 
and to the particular linear combinations performed at the 
network nodes IflOl . and since no other coding approach 
seems more suitable to the noncoherent scenario, we begin 
by treating RLCNs as a discrete memoryless channel (DMC) 
with subspaces as inputs and outputs. We build up a channel 
model for RLCNs that depends solely on the dimension of 
the subspaces that constitute the input alphabet and the rank 
deficiency probability distribution associated with the RLCN. 
The problem is that it is difficult to characterize such channel 
because, as we will see, under certain conditions, the received 
subspace depends on the choice of the transmitted matrix. 
Also, due to the many forms the transfer matrix can take, 
it is difficult, if not impossible, to characterize this channel 
simply from the rank deficiency probability distribution. 

We overcome these drawbacks by introducing a random 
basis selector, a device which selects a matrix uniformly at 
random from the set of all ordered bases for the subspace. 
This idea has been mentioned in ifToll as a way to saying 
that in subspace coding there is no benefit in choosing a 
particular basis for transmitting a given subspace. A similar 



randomization procedure is also followed in J6), with the 
purpose of facilitating the decoding algorithm. Completely 
innocuous from the subspace coding perspective, this random 
basis selection plays a fundamental role in the characterization 
of the channel and in the derivation of its capacity, as we show 
in the present paper. 

Interestingly, the capacity formula we derive exactly corre- 
sponds to the one in [4|, which has been derived for a specific 
network topology. Our conclusions regarding noncoherent RL- 
CNs parallel those in (0) : while Yang and Yang conclude that 
the capacity of unicast networks with given topology is lower 
bounded by the capacity of S-networks, we conclude that 
the same lower bound applies to the capacity of noncoherent 
RLCNs for general (unknown) topology, and that subspace 
coding is sufficient to achieve this bound. 

The paper is organized as follows. In the next section, we 
describe the system model and review some important con- 
cepts for the forthcoming derivations. In Section|lIIl where the 
main contribution of this work is presented, we characterize 
the subspace channel model for RLCNs and derive its channel 
capacity. Finally, in Section [IV] we present our conclusions 
and final comments. 

II. Preliminaries and Notation 

In this paper, random entities are represented using boldface 
letters, while italic letters are used for their samples. However, 
the letters O, U, and V are reserved to denote subspaces. 

A. Random Linear Coding Networks as a Matrix Channel 

Consider a time-varying network with one source node 
and one sink node (unicast), with the intermediate nodes 
performing uniform random linear combining over a finite 
field W q , where q is a prime power. At time slot t, the matrix 
channel for this network is given by 

Y(f) = G(i)X(t), (1) 

where X(i) is the m x T transmitted matrix and Y(i) is the nx 
T received matrix. The rows of X(t) (resp., Y(i)) represent 
the m (resp. ri) transmitted (resp., received) packets, which are 
seen as vectors over W q . G(t) is the network transfer matrix, 
an n x in random matrix over W q whose entries depend on 
the network topology and the network code used. The time 
slot is the time during which G(t) remains constant, and also 
corresponds to T uses of the network, where T is the packet 
size. In this paper, we assume that both the transfer matrix 
and the network topology are unknown to both the transmitter 
and the receiver. Following Q, J6], however, we assume for 
simplicity a square transfer matrix with known dimension m = 
n = h fixed at all times. 

Due to unfortunate selection of random coefficients, the 
network transfer matrix may be rank-deficient. According to 
the channel model in (Q]i, the rank deficiency of G(t), herein 
denoted as tq (i), is a discrete-time stochastic process. We 
assume that tg (t) is first-order stationary so that a probability 
distribution of Ta(t), which is independent of t, exists and is 
given by p^ G (r), where r — 0, . . . , h. 



In practice, if this process is ergodic, its probability dis- 
tribution can be estimated at the sink node by continuously 
updating a histogram based on the rank deficiencjfl of the 
received matrix Y(t). The probability distribution can then be 
communicated to the source node through a low-rate feedback 
channel, either from time to time or once and for all when a 
steady distribution is reached. From now on, we will drop the 
time index t in all variables for convenience. 

B. Subspace Coding 

The idea behind subspace coding is that, since the received 
vectors are linear combinations of the transmitted vectors, the 
vector space spanned by the transmitted vectors is preserved. 
So, in subspace coding, information is encoded in the choice 
of a subspace of the vector space iflOl . We denote by (X) 
the subspace spanned by the rows of the transmitted matrix X. 

A subspace code is then a subset of the projective space 
■p(IF^), which is the collection of all subspaces of TF^ . A 
codeword of a subspace code is one of these subspaces. This 
contrasts with the notion of classical linear block coding, 
where a code is a subspace and a codeword is a vector in 
this subspace. 

Most known subspace codes are constant-dimension codes, 
or codes in the Grassmannian, i.e., subspace codes whose all 
codewords have the same dimension. We denote the Grass- 
mannian of dimension h by V{W^, h), so that P(W ) — 
{J h=0 V(W q , h). The number of distinct ^-dimensional sub- 
spaces of an n-dimensional vector space over IF g is given by 

It) = II q e-i _ i ' 
\ ' q i=o H 

known as the q-ary Gaussian coefficient. 

Although the use of subspaces of multiple dimensions is 
optimal under certain circumstances |9|, there has been an 
increasing interest in constant-dimension codes, chiefly due to 
their simpler encoding and decoding algorithms. 

In Q, Silva et al. have given an important construction of 
constant-dimension codes which builds upon a lifting of a class 
of maximum rank-metric codes due to Gabidulin 111 II . More 
recently, in fl2l . Silva and Kschischang have presented new 
fast encoding and decoding algorithms for Gabidulin codes 
that can be used towards their lifted counterpart. 

C. Discrete Memoryless Channels 

A DMC is defined by the triplet (X,py\^,y), where X 
(resp., y) is the channel input (resp., output) alphabet, and 
PyixO'I-X') is the conditional probability that Y G y is 
received given that X G X is sent. The channel is memoryless 
in the sense that what happens to the transmitted symbol at 
one time is independent of what happens to the transmitted 
symbol at any other time. Let X and Y be the random entities 
representing the input and output symbols, respectively. The 

'Recall that, if X is full-rank then t y = t g if Y = GX. 



channel capacity of the DMC is then given by 

C = max /(X; Y) (2) 

= max HCK) - H(X\Y) = max H(Y) - H(Y\X), 

Px(X) p x (X) 

where 7(X;Y), H(X) (resp., ff(Y)), and i?(X|Y) (resp., 
ff(Y|X)) are the mutual information between X and Y, the 
entropy of X (resp., Y), and the conditional entropy of Y 
(resp., X) given X (resp., Y), respectively. The maximization 
in (O is over all possible input distributions px- 

The points of the distribution Py\x(Y\X) for all X e X 
and all Y € y can be arranged in a matrix, called the transition 
probability matrix, where the X's (resp., Y's) are related to 
the rows (resp., columns). 

A DMC can be pictorially described by a bipartite graph, 
where the nodes at left (resp., right) represent the input 
(resp., output) alphabet. Each input (resp., output) node has 
a number of sockets from (resp., to) which edges emanate 
(resp., connect). If an output symbol occurs with some nonzero 
probability given an input symbol, then there is an edge 
connecting the corresponding input node to the output node. 
Of course, the total number of sockets on the X side equals 
the total number o sockets on the y side. 

The following definitions and theorems have been adapted 
from IB], Ifl4l . 

Definition 1: A DMC is uniformly dispersive (resp., focus- 
ing) if all the rows (resp., columns) of the transition probability 
matrix are permutations of each other. A DMC that is both 
uniformly dispersive and uniformly focusing is said to be 
strongly symmetric. 

Theorem 1: For a strongly symmetric DMC, the capacity 
is given by 

c = \o g \y\-H(p), 

where H(p) denotes the entropy of the probability vector p, 
formed by the elements of a row of the transition probability 
matrix. Moreover, this capacity is achieved by the uniform 
input distribution. 

If a channel is uniformly dispersive but not uniformly 
focusing, it may still have a weaker form of symmetry. 

Definition 2: A uniformly dispersive DMC is said to be 
symmetric (also known as component-symmetric) if, for some 
L, it can be decomposed into L strongly symmetric channels 
with selection probabilities qo, . . . , q^-i- 

Theorem 2: For a symmetric DMC, the capacity is given by 

L-l 
i=0 

where Cj is the capacity of the i-th component strongly 
symmetric channel associated with the symmetric DMC. 

III. The Capacity of the Subspace Channel as a 
Symmetric DMC 

We want to characterize the DMC 

(x=V(wT,h), P r Y)m ,y=\JV(wT,r) ) j 



serving as a channel model for RLCNs essentially without any 
assumption on the network topology or the particular network 
code used. The main goal is to characterize the conditional 
probability p/Y)|(x) mat tne subspace (Y) is received when 
subspace (X) is sent as a function of the rank deficiency 
probability distribution pr G (r). The difficulties related to 
the characterization of this DMC, raised in the introductory 
section, will be illustrated by the next two examples. 

Example 1: Let U = {000,010,100,110} be the input 
subspace. There are six different basis matrices for U, two 
of which are 



X = 
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,x' = 
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Suppose that G has rank deficiency tg = 1 and is given by 



For the corresponding received matrices, Y — GX and 
Y = GX' , the associated one-dimensional output subspaces 
are (Y) = {000, 100} and (Y) = {000, 110}, respectively. 

We can see from Example 1 that, given the input subspace U 
and a fixed non-zero, rank-deficient transfer matrix G, the 
output subspace may depend on the selection of the basis for 
U. The first difficulty resides on the fact that a different DMC 
results for each possible combination of bases selections for 
input subspaces, and there is an exponentially large number 
of them. 

Characterizing our DMC is also difficult because the net- 
work code can affect the transmitted matrix X (and the input 
subspace (X)) in too many ways. The next example illustrates 
the problem. 

Example 2: Consider the same input subspace U in Exam- 
ple 1 and X 1 as the transmitted matrix. Also, suppose again 
that tg — 1 but with the transfer matrix given by 



The received matrix, Y" = G'X, gives rise to the output 
subspace (Y") = {000,010}. 

We can see from Example 2 that, for a fixed transmitted 
matrix X, different transfer matrices with the same rank 
deficiency may produce different output subspaces. (Note that 
this problem does not exist if the distinct transfer matrices have 
full rank.) As a result, for a fixed rank deficiency probability 
distribution the DMC is not fixed, but varies according to the 
realization of G. In other words, a DMC for the subspace 
channel cannot be defined solely from rank deficiencies even 
if a fixed mapping U — > X with (X) = U were specified for 
every subspace U in the Grassmannian. 

We remedy these problems by introducing a random basis 
selector. We show that, upon using a basis matrix X produced 
by this device, the resulting DMC represents RLCNs, and is 
both well-defined and consistent with the subspace coding 
approach. Given a subspace U G X at the input of the 
selector, it outputs a matrix X which is selected, uniformly 



at random, from the set of all basis matrices whose rows span 
the subspace U. Accordingly, the conditional probability of 
basis matrix X given the subspace (X) = U is given by 



of U, which is given by 



P W ){X\U) =\{{q h - 



(3) 



where the inverse of (0 is exactly the number of ordered bases 
that span a subspace U in V(W^ , h) ifTOl . From now on, when 
we refer to a DMC we mean the DMC that models RLCNs 
under this random basis selection. 

We now prove that the distribution P(y)|(x) is fully char- 
acterized by the rank deficiency probability distribution only, 
and is conditionally independent of the network topology and 
the network code used. 

Theorem 3: For the DMC 

h 
r=0 

the transition probability distribution is given by 

Pr G (h — dim V) 

P<y)|<x)(V|E0 = 



h 

dimV 



if V C U 



0, 



else. 



Proof: We begin by expanding the conditional probability 
distribution p^ Y > | (x),t G as follows: 

P{-Y)\(x),r G (V\U,p) 

= J2 p (v),x\(x)--g( v > x \ u >p) 

x 

= ^2px\{X),t g {X\U,p) ■p(Y)\X,{X.),r a {V\X,U,p) 
X 

= E Px|<x>W)-P<y>|x,t g (Wp) 

X:{X) = U 
h 

= n^-^ 1 )^ 1 E P(Y)\x,r G (V\X,p), (4) 

i=l X:(X)=U 

where we used the fact that X is conditionally independent of 
?g given (X) and the result in (O forp X |(x)- The summation 
in (0| can be obtained as follows. Suppose that G is some fixed 
(unknown) matrix G over W q with tq = p = h— dim V. Then 
the transmitted matrix X is mapped into some output subspace 
(GX) = V' with probability one. So, the summand in is 
either 1 (if V' = V) or zero (if V' ^ V). The resulting sum is 
thus the number of ordered bases X that span the subspace U 
in ^(IF^, h) and that produce (GX) = V as output subspace. 
Since the range of X is the set of all ordered bases that span 
the subspace U, by symmetry, the summation in is exactly 
the total number of ordered bases that span the subspace U 
divided by the total number of subspaces of dimension dim V 



( h 



With this result, becomes 



P<Y}|<X),r G (y|^,P) 
1 




, if V CU,p = h-dimV 



(5) 



We now write 

P(Y)\{X)(V\U) 
h 

= E p < Y >K x >.~c ; ( v 1 t/ ' r ') -Pr G (r) 

r=0 

= P{Y)\(x).- G (V\U,h- dimV) ■ pr G (h - dimV), 
which completes the proof. 

■ 

From Theorem [3] we should note that the conditional prob- 
ability P(y)|(x) (V\U) depends on the dimension of V (but not 
on V itself) and is independent of U given that V C U. Since 
the number of subspaces of dimension dim V of a subspace U 
of dimension h is independent of the specific subspace U, 
we can clearly see that our DMC is uniformly dispersive. 
However, it is not strongly symmetric in general. To see this, 
consider the conditional probability P(y}|(x) (^1^0 = Pr G (ty- 
The output subspace U of dimension h is reached by only one 
input subspace, namely, U itself. Consider now the conditional 
probability P(y)\(x)(0\U) = Pr G {h). The zero-dimensional 
output subspace O is reached by all input subspaces. From 
these two situations, we can clearly see that our DMC is not 
uniformly focusing, so it is not strongly symmetric in general. 
Nevertheless, we prove next an important result. 

Lemma 1: TheDMC {V{W T q , h), P(r )\(x), Ur=o^( F «> r )) 
is symmetric. 

Proof: There are h + 1 component channels, DMC r = 
(X r ,p r (y\x),y r ), r = 0,...,h, each one related to one of 
the possible rank deficiencies of the transfer matrix G. As 
explained above, for ?g = and for tg = h, the two com- 
ponent channels are DMCo = (^(IFj, h),po(y\x) = 1 if x = 
y,V{W T q ,h)) and DMC„ = (V(W^h),p h (y\x) = l,{0}), 
respectively. These channels are trivial and clearly strongly 
symmetric. Their capacities are 



C 



logV(W q ,h) = log 



and Ch = 0. Let us then assume that tg = p, where < p < 
h. The input alphabet is X p = VQFZ , h). In the bipartite graph 
representation of DMC P , only those sockets of the X side of 
DMC that are associated to the transitions to output subspaces 



of dimension h — p are considered for the X p side. From ((5]), 
we can see that the output alphabet is y p = V(TF^,h — p), 
and every input (resp., output) subspace U £ X p (resp., V £ 
y p ) is connected with equal transition probability to the same 
number of output (resp., input) subspaces. (The number of 
input subspaces connected to an output subspace V 6 y p can 
be obtained by dividing the number of sockets on the y p side, 
which is the same as the number of sockets on the X p side, i.e., 



h — p 



by the number of output subspaces, namely, ^(IF^, h — p)\.) 
This yields a component strongly symmetric channel, with 
capacity given by (from Theorem [U 



log\V(TF^h-p)\-log\V(TF h q ,h-p)\ 



log 



= log 



T 

h — p 

T 
h — p 



log 



h — p 



h 

h — p 



(6) 



Theorem 4: Given its rank deficiency probability distribu- 
tion Pf G (r), a RLCN seen as a subspace channel can be 
modeled as the DMC (T{W T ql h),p {Y)m , (jJU P(F£, r)). 
The capacity of this DMC is conditionally independent of the 
network topology and the network code used, and is given by 

T 



r=0 ( 

{h-r 

Proof: The proof follows straightforwardly from 
Lemma Q] and Theorem [2] with L = h + 1, <ft = pr G (i), and 
Ci given by ©. ■ 

IV. Conclusion and Final Comments 

In this paper, we have modeled random linear coding 
networks as a symmetric discrete memoryless channel, and we 
have derived the capacity of this channel. We have assumed 
that neither the transmitter nor the receiver has knowledge 
of the network transfer matrix (noncoherent scenario) or the 
network topology. This scenario differs from the one in |4|, 
where the network topology is assumed known and fixed at all 
times. For some fixed topologies, it has been shown in [4| that 
the capacity of RLCNs is lower bounded by the capacity of the 
so-called S-networks, for which constant-dimension subspace 
coding is optimal. 

Herein, we have tactically considered the more general 
situation as an attempt to complement their work, and to try 
to answer some of the open questions raised therein. From the 
results of the present paper, we have the following comments: 



1) The capacity in |4| is also a lower bound for the capac- 
ity of noncoherent RLCNs with unknown topology; if 
constant-dimension subspace coding is used, this lower 
bound is achieved; 

2) The capacity of noncoherent RLCNs, either with gen- 
eral, but fixed and known, or with unknown topology, 
and whether subspace coding (in contrast to matrix 
coding) is sufficient to achieve capacity is still unkown. 

As with any DMC, the capacity of RLCNs is achieved by 
using the channel many times. Since subspace codes are the 
most promising candidates for noncoherent RLNC, multishot 
subspace codes have been investigated in [15], [16|. 

Finally, the case of non-constant-dimension subspace coding 
and the inclusion of erroneous packets to our channel model 
are currently being investigated. 
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